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[57] ABSTRACT 

A method and apparatus for executing queries on a set of 
data that has been partitioned into a plurality of partitions 
based on a partitioning key is provided. A query is received 
that includes a reference to a second key. The second key is 
not part of the partitioning key but has a predetermined 
correlation with the partitioning key. This second key is 
referred to as an overlapping partition key. A subset of the 
plurality of partitions is selected to be scanned based on the 
reference to the second key and the predetermined correla- 
tion with the partitioning key. The query is then executed by 
scanning only those partitions of the plurality of partitions 
that belong to the subset of partitions. The overlapping 
partition key provides for reduced query execution time 
even when the partitioning key is not directly involved in the 
query. Specifically, the overlapping partition key permits a 
partial table scan in situations that would require a fill table 
scan with partitioning alone. 
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USING OVERLAPPING PARTITIONS OF ent in this partitioning method. The value of partitioning to 

DATA FOR QUERY OPTIMIZATION query optimization is limited to situations where the query 

uses the partitioning key directly. 
FIELD OF THE INVENTION It is desirable, therefore, to provide reduced query execu- 
te invention relates generally to the field of database 5 ^ ^ even when the partitioning key is not directly 
°, a ii t u * «• involved in the query. It is also desirable to maintain and 
management systems. More specifically, the invention ^ ^tton predicales 0I1 something other than 
relates to query optimization. ^ partitioning key that can be ^ d by ^ query optimizer 

BACKGROUND OF THE INVENTION to treat individual partitions differently. When values in a set 

10 of one or more columns are correlated to the columns of the 

When a relational database has very large tables consist- partitioning key, it is desirable to take advantage of this 

ing of potentially millions of rows, for example, it is correlation to optimize queries, 
desirable to divide the tables into sub tables ("partitions") of 

a more manageable size. The operation of dividing a table SUMMARY OF THE INVENTION 

into partitions is typically accomplished with a partitioning A method and apparatus for executing queries on a set of 

key, A partitioning key is a key that is used to determine to 15 data that has been partitioned into a plurality of partitions 

which partition a particular record (row) belongs. The par- based on a first key is disclosed. A query is received that 

titioning key can be defined in terms of one or more includes a reference to a second key. The second key is not 

attributes (columns) of the table. The resulting partitioned part of the first key but has a predetermined correlation with 

table consists of several partitions, each containing the same the first key. Asubset of the plurality of partitions is selected 

columns as the partitioned table. However, each partition 20 to be scanned based on the reference to the second key and 

contains only a subset of the rows of the partitioned table. the predetermined correlation with the first key. The query is 

Partitioning a table has a positive effect on query pro- then executed by scanning only those partitions of the 

cessing. By using information regarding the partitioning plurality of partitions that belong to the subset of partitions, 

scheme of a table, the amount of time required to execute a According to one aspect of the invention, a set of query 

query that accesses the table may be reduced. As will be predicates is determined from the query. A set of selection 

illustrated further with reference to FIG. 1, partitioning predicates is created and initialized with the set of query 

allows a query to be processed with a partial table scan predicates. One or more predicates that represent the second 

rather than a full table scan if the query involves a predicate key's predetermined correlation with the first key are added 

containing the partitioning key. If the query involves the 3Q to the set of selection predicates. Then, the set of selection 

partitioning key, the number of partitions that need to be predicates is used to determine the subset of partitions to be 

searched may be reduced prior to executing the query. For scanned. 

example, the query optimizer can generate a query plan that According to another aspect of the invention, after the 

excludes partitions that cannot possibly contain rows that predicates have been added to the set of selection predicates, 

satisfy the user specified conditions. 35 new predicates are transitively generated based on the 

FIG. 1 is a simple table with four columns: order #, ship predicates in the set of selection predicates. The new predi- 

date, receive date, and items. The table is partitioned by the cates are added to the set of selection predicates, 

ship date column. More specifically, the table is partitioned Then, those of the one or more predicates that represent 

by the year of the ship date. Therefore, records having the the correlation between the first and second key and the 

same ship date year are stored in the same partition. The 40 newly generated predicates that are not constant predicates 

table contains n partitions, partition 1 contains records are removed from the set of selection predicates. Also, any 

having a ship date year of 1968, partition 2 contains records of the newly generated predicates that can produce a result 

with a ship date year of 1969, partition 3 contains records of UNKNOWN are removed from the set of selection 

with a ship date year of 1970, partitions 4 through n-1 (not predicates. 

shown) contain ship date years 1971 through 1995, and 45 Selecting the subset of partitions to scan is accomplished 

partition n contains records with a ship date year of 1996. by evaluating the constant predicates in the set of selection 

To illustrate how partitioning can be used to reduce query predicates. The partitions that are included in the subset of 

execution time, assume the user has requested to see a list of partitions to be scanned are those partions that correspond to 

records with fifty or more items and shipped in the year selection predicates containing constant predicates that 

1969. In this example, the first partition can be skipped 50 evaluate to TRUE or UNKNOWN, 

altogether because it contains records associated with ship BRIEF DESCRIPTION OF THE DRAWINGS 
dates occurring in the year 1968. Partitions 3 through n can 

also be skipped because none contain records having a ship ^ P resent invention is illustrated by way of example, 

date year of 1969. The query plan generated by the query and not bv wa y of l™itaUon, ™ the fig^s of tne accom ' 

optimizer accordingly can be limited to a search of partition 55 P^g drawings and in which like reference numerals refer 


2. 


to similar elements and in which: 


As illustrated above, partitioning is useful for reducing FIG - 1 is a lable divided into partitions by a partitioning 

the amount oftime required to execute a query. However, the ^ey. 

advantages of this technique can only be enjoyed when the FIG. 2 is an example of a typical computer system upon 

query contains a predicate directly involving the columns in 60 which one embodiment of the present invention can be 

the partitioning key. Referring again to FIG. 1, if the user implemented. 

had requested a list of records having 50 or more items and FIG. 3 is a table partitioned as the table of FIG. 1 with the 

a receive date in 1970, then the query optimirer would be addition of an overlapping partition key according to one 

unable to eliminate any of the partitions from the query plan. embodiment of the present invention. 

As a result, all of the partitions would have to be searched. 65 FIG. 4 is a flow diagram illustrating a method of execut- 

Therefore, in this example, partitioning has provided no ing queries according to one embodiment of the present 

benefit at all. This example illustrates the limitations inher- invention. 
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FIG. 5 is a flow diagram illustrating a method of selecting 
a subset of partitions using an overlapping partitioning key 
(step 435 of FIG. 4) according to one embodiment of the 
present invention. 

FIG. 6 is an illustration of a database management system 
using check constraints to implement overlapping partition 
keys according to one embodiment of the present invention. 

FIG. 7 is an illustration of a database management system 
using check constraints to implement overlapping partition 
keys according to another embodiment of the present inven- 
tion. 

FIG. 8 is an illustration of a database management system 
using an index to implement overlapping partition keys 
according to one embodiment of the present invention. 

FIG. 9 is an illustration of a database management system 
using partition value arrays according to one embodiment of 
the present invention. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

A method and apparatus for recognizing overlapping 
partition keys (OPkeys) and telling the query optimizer 
about them is described. In the following description, for the 
purposes of explanation, numerous specific details are set 
forth in order to provide a thorough understanding of the 
present invention. It will be apparent, however, to one 
skilled in the art that the present invention may be practiced 
without these specific details. In other instances, well-known 
structures and devices are shown in block diagram form in 
order to avoid unnecessarily obscuring the present inven- 
tion. 

Hardware Overview 

Referring to FIG. 2, a computer system is shown as 200. 
The computer system 200 represents a node (site) within a 
distributed processing system upon which the preferred 
embodiment of the present invention can be implemented. 
The hardware architecture of nodes within the distributed 
processing system can be varied and diverse. There is no 
requirement in the present invention that each node have 
equivalent and compatible processing systems. It is only 
necessary that each node of the distributed processing sys- 
tem be able to communicate on a network or some commu- 
nication path coupling the nodes together. Computer system 
200 comprises a bus or other communication means 201 for 
communicating information, and a processing means 202 
coupled with bus 201 for processing information. Computer 
system 200 further comprises a random access memory 
(RAM) or other dynamic storage device 204 (referred to as 
main memory), coupled to bus 201 for storing information 
and instructions to be executed by processor 202. Main 
memory 204 also may be used for storing temporary vari- 
ables or other intermediate information during execution of 
instructions by processor 202. Computer system 200 also 
comprises a read only memory (ROM) and/or other static 
storage device 206 coupled to bus 201 for storing static 
information and instructions for processor 202. Data storage 
device 207 is coupled to bus 201 for storing information and 
instructions. 

A data storage device 207 such as a magnetic disk or 
optical disc and its corresponding drive can be coupled to 
computer system 200. Computer system 200 can also be 
coupled via bus 201 to a display device 221, such as a 
cathode ray tube (CRT), for displaying information to a 
computer user. An alphanumeric input device 222, including 
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alphanumeric and other keys, is typically coupled to bus 201 
for communicating information and command selections to 
processor 202. Another type of user input device is cursor 
control 223, such as a mouse, a trackball, or cursor direction 

5 keys for communicating direction information and com- 
mand selections to processor 202 and for controlling cursor 
movement on display 221. This input device typically has 
two degrees of freedom in two axes, a first axis (e.g., x) and 
a second axis (e.g., y), which allows the device to specify 

10 positions in a plane. 

Alternatively, other input devices such as a stylus or pen 
can be used to interact with the display. A displayed object 
on a computer screen can be selected by using a stylus or pen 
to touch the displayed object. The computer detects the 

15 selection by implementing a touch sensitive screen. 
Similarly, a light pen and a light sensitive screen can be used 
for selecting a displayed object. Such devices may thus 
detect selection position and the selection as a single opera- 
tion instead of the "point and click," as in a system incor- 

20 poraling a mouse or trackball. Stylus and pen based input 
devices as well as touch and light sensitive screens are well 
known in the art. Such a system may also lack a keyboard 
such as 222 wherein all interface is provided via the stylus 
as a writing instrument (like a pen) and the written text is' 

25 interpreted using optical character recognition (OCR) tech- 
niques. 

Another device which may optionally be coupled to bus 
201 is a hard copy device 224 which may be used for 
printing instructions, data or other information on a medium 

30 such as paper, film, or similar types of media. In the 
preferred embodiment, a communication device 225 is 
coupled to bus 201 for use in accessing other nodes of the 
distributed system via a network. The communication 
device 225 may include any of a number of commercially 

35 available networking peripheral devices such as those used 
for coupling to an Ethernet, token ring, Internet, or wide area 
network. Note that any or all of the components of the 
system illustrated in FIG. 2 and associated hardware may be 
used in various embodiments of the present invention; 

40 however, it will be appreciated by those of ordinary skill in 
the art that any configuration of the system may be used for 
various purposes according to the particular implementation. 
The present invention is related to the use of computer 

45 system 200 to reduce the execution time of queries executed 
on a set of data. As computer system 200 executes a 
program, the processor 202 accesses data stored in a data 
dictionary within main memory 204 to facilitate query 
optimization. Importantly, the present invention is not lim- 

5Q ited to having all partitions located on the same computer 
system. Rather, the partitions might be spread over several 
databases in a distributed network of databases. 

Overlapping Partition Keys 

55 As discussed earlier, dividing a table into partitions allows 
query execution time to be reduced by removing from 
consideration those partitions that cannot possibly contain 
rows that satisfy specified query conditions. However, this 
benefit can only be achieved when the query directly 

60 involves the partitioning key. Therefore, it is desirable to 
provide a solution that can reduce query execution time even 
when the partitioning key is not directly involved in the 
query. 

An overlapping partition key (OPkey) is a set of columns 
65 in a partitioned table whose values are strongly correlated to 
the partitioning key of that table. Overlapping partition keys 
allow the system to remove from consideration many of the 
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table's partitions at compile time. OPkeys permit partial At step 410, a query is received. The query is parsed in 

table scans in situations that would require full table scans step 415. At step 416, a set, X, representing the partitions 

with partitioning alone. This advancement is illustrated with selected for query execution is initialized to include all the 

reference to FIG. 3. table's partitions. Transitive predicate generation is per- 

F1G. 3 is a table partitioned as the table of FIG. 1 with the 5 formed in step 420. The generation of new predicates and 

addition of an overlapping partition key according to one transitive predicate generation shall be described in greater 

embodiment of the present invention. The only difference detail below. In step 425, it is determined if the query 

between FIG. 1 and FIG. 3 is the designation of the receive predicates involve the partitioning key. 

date column as an overlapping partition key. A previous _ r „ , . , . , 

example, discussed with resect to FIG. 1, involved a user 1fl If ! he . ^ ? ate K determ !? ed t £ 

request for a list of records having 50 or more items and a 10 Phoning key, X is set to a subset of the table s partitions 

receive date in 1970. That example illustrated a query that US1D S Petitioning key in step 445. This selection can be 

would require a full table scan when only a partitioning key accomplished by choosing only the partitions that could 

is used. The same query on the table of FIG. 3, however, can possibly contain the values involved in the query predicates, 

be executed in a fraction of the time by using a known Since a query might refer to both keys, upon completion of 

relationship between the partitioning key and the OPkey. In 15 step 445 processing continues at step 430. 

the illustrated table, the values stored in the receive date Returning to step 425, if the partitioning key is not 

column are highly correlated to the corresponding values involved in the query predicates, X remains set to all the 

stored in the ship date column. Specifically, values in the table's partitions and the method continues at step 430. 

receive date column are always greater than or equal to the At step 430, it is determined whether or not the query 

corresponding values in the ship date column and the receive 20 predicates involve the OPkey. If the query predicates involve 

date is within three months of the corresponding ship date. the OPkey, a subset of the partitions in the set X is selected 

The database management system can take advantage of using the OPkey. This selection can be performed with 

the OPkey's relationship to the partitioning key to optimize reference to the known correlation between the OPkey and 

queries. For example, to find the records having 50 or more the partitioning key and the query predicates. For example, 

items and a receive date in 1970, only partitions having a a table for a telephone directory partitioned on area codes 

ship date that can meet the known relationship given the fact might have a name column and a zip code column. The name 

that the receive data is in 1970 need to be scanned. column is relatively independent of the partitioning key, area 

Therefore, in this example, those partitions whose highest code. While the zip code column is highly correlated to the 

ship date is less than 10/1/69 (3 months prior to 1/1/70) can 3Q area code column. Once the user observes the correlation 

be removed from consideration. Accordingly, partition 1 of between the area code and the zip code columns, the user can 

FIG. 3 does not need to be scanned. None of the ship dates define a set of rules or correlation predicates to describe the 

in partition 1 can satisfy the known relationship because relationship. Then, a query involving a specific zip code can 

none of the ship dates in this partition are within 3 months be limited to a scan of the partitions containing area codes 

of 1970. 35 that meet the defined relationship. 

Also, those partitions whose lowest ship date is greater Returning to step 430, if it is determined that the query 
than 12/31/70 can be removed from consideration. predicates do not involve the OPkey, then X remains unal- 
Accordingly, partitions 4 through n do not need to be tered. Since X was initialized to contain all the table's 
scanned. None of the ship dates in partitions 4 through n can partitions, if the query predicates involve neither the parti- 
satisfy the known relationship because all of the ship dates ^ tioning key nor the OPkey, then all the partitions remain 
in these partitions have a ship date that is greater than 1970. selected when processing flows to step 450. 

In this example, therefore, only partition 2 and partition 3 After the partitions have been selected, the query is 

need to be scanned to find the requested records. As this executed upon the selected partitions in set X at step 450. 

example illustrates, not every search involving the OPkey Importantly, not all of the selected partitions will have to be 

can be limited to a single partition. Generally, the OPkey's 45 searched. Those partitions that can be evaluated to FALSE 

partition of rows is neither a finer nor a coarser partition of in constant time can be skipped. Thus, some partitions can 

the partitioning key's partition. From this perspective, the still be optimized away during this execution phase. 

OPkey can be thought of as producing another set of Although OPkeys will now be described with reference to 

partitions overlapping the partitions created by the partition- specific embodiments, many alternative embodiments are 

ing key; hence, the name "overlapping partition key." 50 available for implementing the query execution technique 

In the following discussion, embodiments of the invention using OPkeys described above. Therefore, the specific 

shall be described with reference to a table that has been embodiments described should be regarded in an illustrative 

partitioned. However, the format in which the partitioned rather than a restrictive sense, 

data is stored will vary from implementation to implemen- Selection Predicates 

tation. For example, one or more attributes of an object type 55 A selection predicate, sel ( ., is a logical expression that 

may be used as an OPkey for searches performed on a set of provides a way of excluding the corresponding partition 

objects of the object type that have been grouped from the query plan at compile time ("optimizing away"), 

(partitioned) according to a different set of one or more Generally, selection predicates are built by combining three 

attributes of the object type. Consequently, the present sets of predicates: (1) partitioning predicates, (2) query 

invention is not limited to the use of OPkeys on partitions eo predicates, and (3) correlation predicates. The steps involved 

that group any particular type or format of data. in building selection predicates from the three sets of 

FIG. 4 is a flow diagram illustrating a method of execut- predicates and applying the selection predicates will be 

ing queries according to one embodiment of the present discussed in detail below. 

invention. It is assumed that a set of data, here a table, has Partition i can be excluded from the query plan, if, at 

been divided into a plurality of partitions by a partitioning 65 anytime during the building of selection predicates, it is 

key. Another presumption is that an OPkey has been defined determined that the corresponding selection predicate, sel,-, 

that has a strong correlation with the partitioning key. must evaluate to FALSE for all the rows in partition i. For 
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example, if the selection predicate for partition 1 of FIG. 3, the value to be substituted for the host variable in the 

was reduced to only constant predicates (4/1 9/68 >= 1/1/68) expression. In the example predicates above, routines would 

and (4/1 9/68<= 3/31/69), then the result would be TRUE for be invoked to perform the index traversal to return the high 

every row in partition 1. If the selection predicate for or low value for a given partition. 

partition 2 was reduced to constant predicates (4/19/68>=l/ 5 Referring now to FIG. 5, a method of selecting a subset 

1/69) and (4/1 9/68<=3/3 1/70), then the result would be of partitions using an overlapping partitioning key (step 435 

FALSE for every row in partition 2. Therefore, in this of FIG. 4) according to one embodiment of the present 

example, partition 2 would be excluded from the query plan invention is illustrated by a flow diagram. First, each of the 

at compile time and partition 1 would remain in the query steps is briefly described. Then, a concrete example is 

plan to be evaluated at run-time. 10 discussed with reference to FIG. 6 and FIG. 3. 

Per-partition Predicates In one embodiment, a list of selection predicates is 

Per-partition predicates are separate and independent evaluated to determine a subset of partitions upon which the 

predicates that are stored for each partition of the table. query will be executed. At step 520, the list of selection 

Per-partition predicates may include both primary and sec- predicates is initialized with the query predicates. Per- 

ondary partitioning predicates. For example, one predicate 15 partition predicates are loaded at step 525. 

corresponding to the primary partitioning key for partition 1 Next, At step 530, the per-partition predicates are 

of FIG. 3 might be (ship date>=l/l/68). appended to the list of selection predicates. New predicates 

The correlation predicates that define the relationship are generated based on the per-partition predicates and the 

between the OPkey and the primary partitioning key result query predicates, step 535. Transitivity is one of many 

in secondary partitionings of the table. Generally, secondary 20 properties that can be used to generate new predicates based 

partitionings can be specified in one of two ways. First, a on known predicates. For example, transitive predicate 

secondary partitioning key might be specified by a set of generation produces the new predicate (a=c) if (a=b) and 

predicates, S ( -, for each partition. Also, a secondary parti- (b=c) are known predicates. 

tioning key might be defined by a single predicate, S, that The newly generated predicates are added to the list of 

relates the secondary partitioning key to another partitioning 25 selection predicates at step 540. After generating all possible 

key. It is not necessary that all n S,- be explicit. It is possible predicates, all nonconstant per-partition predicates and all 

that a single constraint, involving a reference to the primary newly generated predicates that are not constant predicates 

partitioning key can be specified. Then transitivity and are removed from the selection predicate list, step 545. A 

simple arithmetic manipulation can automatically be used to constant predicate is one that must evaluate to the same 

infer the S f . For example, (received date>-ship date) and 30 value for every row. Therefore, constant predicates can be 

(received date<-ship date+90 days) can be used to generate evaluated at run-time in constant time independent of the 

the n received date partitioning predicates by substituting the number of rows to be fetched. 

lowest and highest ship dates for each partition into the Instep 550, any of the newly generated predicates that can 

single constraint. Further, the S t - can be avoided altogether evaluate to the constant UNKNOWN are discarded. For 

by using transitivity to rewrite the query predicate to refer- 35 example, the predicate (1-NULL) evaluates to 

ence ship date instead of received date. For example, a query UNKNOWN, but the predicate (1 <-:high) does not evaluate 

predicate (received date-x) can be turned into (ship date<- to UNKNOWN if the value of high is known not to be 

x) and (ship date>-x-90)— an exact match is turned into a NULL. 

range query. In step 551, redundant query predicates are discarded by 
The secondary partitioning predicates (correlation 40 removing them from the selection predicate list. Query 
predicates) can be based upon many alternative representa- predicates can be removed if, given the primary and sec- 
tions of the known relationship between the OPkey and the ondary partitioning predicates, the query predicates are 
primary partitioning key. For example, the predicates might redundant. For example, if the query, Q, is 7/l/67<«ship 
be represented with a check constraint for each partition or date< =7/1/70 and P,- is 1/1/68 <=ship date <» 12/3 1/68, then 
a single check constraint with per-partition constraint tem- 45 the terms of Q may be removed from the corresponding 
plates. selection predicate, se^. 

Check constraints are logical expressions that define a At this point, only original, nonredundant query predi- 

range of values that are acceptable for a column within a cates and new constant predicates remain in the fist of 

corresponding set of rows. For example, the check constraint selection predicates. By keeping only new constant 

(items>«0) might be used to ensure that no row is added to 50 predicates, conflicts between the operation of the Structured 

the table with a value below zero. A constraint template is a Query Language (SQL) where-clause and check constraints 

place holder in a logical expression that will have a data can be avoided. Check constraints accept rows if the rows do 

value substituted for it prior to the evaluation of the expres- not evaluate to FALSE. The SQL where-clause selects rows 

sion. For example, in the predicate (items<=max items), max if the rows evaluate to TRUE. Therefore, it is possible for a 

items is a variable that can have different values bound to it 55 row to evaluate to UNKNOWN, in which case the same 

for each partition. For partition 1, max items might be set to predicate would accept the row as a check constraint but 

100, and for partition 2, max items might be set to 200. reject the row as a where-clause. 

Other alternatives include the tracking of high and low In one embodiment, the per-partition predicates are based 

values for each partition for use in building predicates: upon a single check constraint for the entire partitioned table 

(OPkey<=:high) and (OPkey>=:low), or an index on the eo with per-partition constraint templates. At step 555, prior to 

OPkey could be traversed to find the high and low values for evaluating the selection predicates, the per-partition values 

each partition. Further, if the partitions are constrained to not are bound into the new constant predicates that remain in the 

overlap, an ordered list of high keys or low keys could be list of selection predicates. 

maintained for each partition. The variables ":high" and Next, at step 560, for each selection predicate correspond- 

":low" in the above predicates are referred to as host 65 ing to a partition that has not already been excluded, the 

variables. Host variables are evaluated during query execu- remaining constant predicates are evaluated. In one 

tion by calling a designated routine which in turn will return embodiment, it is determined if any of the constant selection 
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predicates evaluate to FALSE, step 565. If any of the 
constant selection predicates are FALSE, none of the rows in 
the corresponding partition need to be considered. 
Accordingly, such partitions are excluded from the query 
plan at step 575. 

If none of the constant selection predicates were found to 
be FALSE in step 565, the corresponding partition could not 
be optimized away during compile time and it will be 
included in the query plan, step 570. 

Thus, there are two possible outcomes for a given selec- 
tion predicate. First, a given selection predicate may be 
"optimized away" because it is known to be FALSE at 
compile time (steps 560, 565, and 575). In this case, a 
partition access operation is removed from the plan. 
Alternatively, if the selection predicate cannot be optimized 
away during the compile phase, a run-time test can be 
generated which, if FALSE, skips the corresponding parti- 
tion at run-time (step 450). 

Check Constraint-based Implementation of 
Overlapping Partition Keys 

Check constraints are typically used to enforce domain 
limitations on columns of a table. For example, the set of all 
acceptable values (domain) for a column that stores data 
values indicating age might be limited to positive integer 
values. Generally, the limitations are enforced at the time an 
update operation is attempted Prior to allowing an update 
operation on the table, a determination is made as to whether 
the record being modified or inserted complies with any 
specified domain limitations. For example, a column that 
stores values indicating gender might be limited to data 
values of "M " "F," "Male/ 1 and "Female." The check 
constraint for the gender column might be specified by the 
following predicates: (Gender«"M") or (Gender="F") or 
(Gender="Male") or (Gender="Female"), for example. If an 
update operation attempted to add a record having a gender 
data value other than those meeting the check constraint, the 
transaction would be rejected and an error message might 
inform the user of the allowable gender values. 

FIG. 6 is an illustration of a database management system 
using check constraints to implement overlapping partition 
keys according to one embodiment of the present invention. 
FIG. 6 shows a database 610 containing a partitioned table 
660 which has n partitions 670. The database includes a data 
dictionary 620. The data dictionary 620 has stored therein a 
list of n check constraints 630. The database also includes a 
list of selection predicates 650 that are used to select a subset 
of the partitions 670 for one or more query plans 690. 

According to one embodiment, check constraints in the 
list of check constraints 630 are used to enforce the corre- 
lation between the OPkey and the partitioning key. Each 
element in the list of check constraints has a corresponding 
partition in the partitioned table 660. An element in the list 
of check constraints 630 can be represented with predicates. 
For example, referring to FIG. 3, the check constraint 
predicates for partition 1 might be expressed as (receive 
date>=l/l/68) and (receive date<=3/31/69), the check con- 
straint predicates for partition 2 would be (receive date>« 
1A/69) and (receive date <-3/3 1/70), and the check con- 
straint predicates for partition n would be (receive date>- 
1/1/96) and (receive date<-3/31/97). 

The following example illustrates the method of FIG. 5 
using the check constraint implementation of FIG. 6 and the 
partitioned table of FIG. 3. Assume, after receipt of query 
640, parsing and transitive predicate generation results in 
query predicate (receive date«3/ll/69). Then, in this 
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example, all the elements of the list of selection predicates 
650 are initialized to (receive date=3/ll/69). Next, the check 
constraints are appended to the list of selection predicates 
650. Predicates (receive date>-l/l/68) and (receive date<- 

5 3/31/69) are added to the element of the list of selection 
predicates 650 corresponding to partition 1, sel r Predicates 
(receive date >» 1/1/69) and (receive da te< =3/3 1/70) are 
added to the element of the list of selection predicates 650 
corresponding to partition 2, sel^. Predicates (receive date>= 

10 1A/70) and (receive date<=3/3/71) are added to the element 
of the list of selection predicates 650 corresponding to 
partition 3, sel 3 . Predicates (receive date>«l/l/96) and 
(receive date<»3/31/97) are added to the element of the list 
of selection predicates 650 corresponding to partition n, sel„. 

15 Table 1 shows the state of the list of selection predicates 650 
after step 530. 


25 


TABLE 1 


J+ist of Selection Piedjcates 


Element 

Contents 


a 

2 
3 

(receive date ■ 3/11/69), (receive date >= 

(receive date <- 3/31/69) 

(receive date = 3/11/69), (receive date >= 

(receive date <= 3/31/70) 

(receive date - 3/11/69), (receive date >- 

(receive date 3/31/71) 

1/1/68) and 
1/1/69) and 
1/1/70) and 

n 

(receive date = 3/11/69), (receive date >- 
(receive date <= 3/31/97) 

1/1/96) and 


30 


New predicates are transitively generated based on the check 
constraits and the query predicates. New predicates (3/11/ 
69>=l/l/68) and (3/ll/69<=3/31/69) are added to sel^ new 
predicates (3/ll/69>=l/l/69) and (3/1 1/69 <=3/31/70) are 
35 added to sel^, new predicates (3/ll/69>=l/l/70) and (3/11/ 
69<=3/31/71) are added to sel 3 , and new predicates (3/11/ 
69>=l/l/96) and (3/1 1/69< =3/3 1/97) are added to sel„. Table 
2 shows the state of the list of selection predicates 650 after 
step 540. 

40 

TABLE 2 


List of Selection Predicates 


Element Contents 


1 (receive date - 3/11/69), (receive date >- 1/1/68) and 
(receive date <- 3/31/69), (3/11/69 >= 1/1/68) and 
(3/11/69 <= 3/31/69) 

2 (receive date - 3/11/69), (receive date >- 1/1/69) and 
(receive date <= 3/31/70). (3/11/69 >= 1/1/69) and 

50 (3/11/69 <= 3/31/70) 

3 (receive date - 3/11/69), (receive date >- 1/1/70) and 
(receive date <= 3/31/71), (3/11/69 >» 1/1/70) and 
(3/11/69 <- 3/31/71) 

n (receive date - 3/11/69), (receive date >» 1/1/96) and 

55 (receive date <- 3/31/97), (3/11/69 >- 1/1/96) and 

(3/11/69 <= 3/31/97) 


Upon evaluating the new constant predicates of selj, the 

result is TRUE, for seL^ the result is TRUE, for sel 3 the result 
60 is FALSE, and for sel n the result is FALSE. Therefore, in this 

example, only partition 1 and partition 2 will be scanned. 
FIG. 7 is an illustration of a database management system 

using check constraints to implement overlapping partition 

keys according to another embodinent of the present inven- 
65 tion. In this embodiment, the data dictionary 620 has stored 

therein check constraints 730 and per-partition constraint 

template values 780. 
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According to one embodiment, check constraints 730 are the lower bound of partition 2, the upper bound of partition 

used to enforce the correlation between the OPkey and the 2 is the lower bound of partition 3, etc. Also, with this 

partitioning key for the entire partitioned table 660. The implementation, there can be no gaps between consecutive 

check constraints 730 can be represented with predicates partitions. For example, given the constraints above with 

containing constraint templates. The constraint templates act 5 X^IO, X2=20, and X 3 =30, all values between 10 and 20 are 

as place holders in the predicates that are replaced with a mapped to partition 1, and all values between 20 and 30 are 

corresponding per-partition value during the compile phase. mapped to partition 2. 

For example, the check constraint predicates for FIG. 3 FIG. 9 is an illustration of a database management system 

might be expressed as: (receive date>»ship date) and using partition value arrays. The data dictionary 620 has 

(receive date<=ship date+days), where days represents a 10 stored therein two independent partition value arrays, X,- 910 

constraint template. While the potential exists for each and Y ( 920, and an array of partitioning predicates, P, 930. 

partition to have a different value for the constraint template, To increase the flexibility of partitioning keys the upper 

in this example, all the partitions have a constraint template and i owe r boundaries of partitions are independent. This is 

value of 90 days. accomplished, in one embodiment, using two partition value 

^ 15 arrays, X, 910 and Y. 920 each having n boundary values, to 

Index-based Implemention of Overlapping \ ■ ,u * ♦ a t • 

„ constrain the partitions instead of using consecutive array 

locations in the same partition value array. In another 

Indices are generally used to speed up searches in the embodiment, one partition value array having 2n locations 

column or columns of a table that are indexed. An index can be used. One portion of the array stores the upper 

provides a mapping between data values to the row or rows 20 boundary values and another portion stores lower boundary 

in the table containing those data values. An index can be values. 

implemented with a bash table, an ordered list, a B-tree, and Independent upper and lower bounds allow consecutive 

many other alternative structures. An index can be either partitions to overlap; therefore, in this embodiment, P,. can 

local or global. A global index is one structure pointing to be TRUE for more than one partition. Thus, a given value in 

rows in every partition. A local index is actually a set of 25 the partitioning key could potentially be mapped to more 

indices, one per partition, where the index for each partition than one partition. Further, in this embodiment, gaps can 

points at rows only in its partition. exist between consecutive partitions. For example, given the 

FIG. 8 is an illustration of a database management system constraints X^-P^-Y, where Xj-10, Yj-15, X 2 -20, 

using an index to implement overlapping partition keys Y 2 -30, X 3 -25, and Y 3 -35, all values between 10 and 15 are 

according to one embodiment of the present invention. In 30 mapped to partition 1, values between 20 and 30 might be 

this embodiment, the data dictionary 620 has stored therein in partition 2 or partition 3, and a gap exists between the 

an index 830. values 15 and 20. 

In one embodiment, the columns of the OPkey are When a row to be inserted in the parutioned table contains 

indexed by the local index 830. The local index 830 contains a partitioning key value that maps to more than one partition, 

a set of indices. Each index within the set contains refer- a policy must be provided to determine to which of the 

ences to all the data values in the OPkey columns for each possible partitions the row should be inserted. Such a policy 

partition in the partitioned table 660. may be, for example, to always insert the row in the first 

In another embodiment the indices track at least high and qualifying partition that is encountered. An alternative 

low values per partition. In this embodiment, the per- 40 P ohc y ma y be to require a user to select among the quali- 

partition predicates (OPkey<-:high) and (OPkey>-:low) are ^ partitions. The present invention is not limited to any 

appended to the list of selection predicates 650 instead of or Particular policy for selecting among qualifying partitions, 

in addition to check constraint predicates. At run-time, the In thc foregoing specification, the invention has been 

high and low values can be fetched from the index and the described with reference to specific embodiments thereof. It 

values can be bound to the generated predicates in the list of 45 ^ however, be evident that various modifications and 

selection predicates 650. In yet another embodiment, no changes may be made thereto without departing from the 

index is maintained, but high and low values are tracked for broader spirit and scope of the invention. The specification 

each partition and can be fetched and bound at run-time as and drawings are, accordingly, to be regarded in an illus- 

in the prior embodiment, trativc ra t thcr than a restrictive sense. 

5Q What is claimed is: 

Overlapping Partitions 1. A method for executing queries that specify data from 

Partitioning is based on each partition, i, having a parti- a sct of data ^ at ha * bccD P artitioned into a P 1 ™ 1 */ of 

tioning predicate, P,, P ( - can be a boolean expression of P^ons based on a first key the method comprising the 

terms, T iy (e.g.,T a and TJ. T„ is a comparison involving the oom P uter lamented steps of: 

partitioning key and a constant (e.g., ship date>= 1/1/69). It 55 receivm g a 9 ucr y ^ includes a reference to a second 

is customary that for every primary partitioning key, P- is kev > wherein said second key is not part of said first key 

TRUE for at most one partition, i, where l<-i<-n. That is, but has a predetermined correlation with said first key; 

there exists a one-to-one or many- to -one mapping between selecting a subset of said plurality of partitions to scan 

data values and partitions. A given value in the partitioning based on said reference to said second key and said 

key is mapped to only one partition in the tabic. eo predetermined correlation with said first key; and 

Strict partitioning is generally implemented with n-1 executing said query by scanning only those partitions of 

boundary conditions. The n-1 boundary conditions define said plurality of partitions that belong to said subset of 

the upper and lower boundaries for n partitions. One limi- partitions. 

tation resulting from strict partitioning is the dependence of 2. The method of claim 1 further comprising the steps of: 

a given partition's boundaries on the boundaries of the 65 determining a set of query predicates from said query; 

preceding and succeeding partitions. For example, given the adding said set of query predicates to a set of selection 

constraints X / <e»P i <oX /+1 , the upper bound of partition 1 is predicates; 
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adding one or more predicates to said set of selection 
predicates, said one or more predicates based on said 
predetermined correlation with said first key; and 

using said set of selection predicates to select said subset 
of said plurality of partitions to scan. 5 

3. The method of claim 2 further comprising the steps of: 
after said step of adding one or more predicates to said set 

of selection predicates, transitively generating new 
predicates based on said set of selection predicates; 
adding said new predicates to said set of selection predi- 
cates; 

removing from said set of selection predicates those of 
said one or more predicates and said new predicates 
that cannot be evaluated in constant time; 15 

removing from said set of selection predicates those of 
said new predicates that can produce an unknown 
result; and wherein 

said step of selecting a subset of said plurality of 
partitions to scan includes 2 o 
evaluating constant predicates in said set of selection 

predicates, and 
including in said subset those partitions correspond- 
ing to selection predicates having constant predi- 
cates that evaluate to a predetermined state. 25 

4. The method of claim 2 further comprising the steps of: 
excluding from said query plan those partitions of said 

plurality of partitions corresponding to selection predi- 
cates having constant predicates that evaluate to a 
predetermined state; and 30 
including in a query plan only those partitions of said 
plurality of partitions corresponding to selection predi- 
cates having constant predicates that do not evaluate to 
said predetermined state. 

5. The method of claim 1 further comprising the steps of: 35 
receiving a request to modify a subset of data in said set 

of data, wherein said subset of data includes one or 
more values for said first key and one or more values 
for said second key; 

determining whether said one or more values for said 
second key have said predetermined correlation with 
said one or more values for said first key; 

if said one or more values for said second key have said 
predetermined correlation with said one or more values 45 
for said first key, then selecting a target partition based 
on said one or more values for said first key, and storing 
said subset of data in said target partition; 

if said one or more values for said second key do not have 
said predetermined correlation with said one or more 50 
values for said first key, then generating an error 
message without adding said subset of data to said set 
of data. 

6. The method of claim 1 further comprising the step of, 
for each partition of said plurality of partitions, generating 55 
data that indicates values of said second key that satisfy said 
predetermined correlation with values for said first key that 
correspond to said partition. 

7. A method of producing query plans for executing 
queries on a set of data that has been partitioned into a ^ 
plurality of partitions based upon a first key, the method 
comprising the computer implemented steps of: 

receiving a query; 

selecting a subset of partitions to scan from said plurality 
of partitions, the selection being performed by 65 
if said query includes a reference to a second key and 
does not refer to said first key, then determining said 


40 


subset of partitions to scan based on said reference to 
said second key and a predetermined correlation 
between said second key and said first key, wherein 
said second key is not part of said first key; and 
producing a query plan which includes only those parti- 
tions of said plurality of partitions that belong to said 
subset of partitions. 

8. The method of claim 7 wherein said step of selecting a 
subset of partitions to scan further comprises the steps of: 

if said query includes a reference to said first key and does 
not refer to said second key, then deterrnining said 
subset of partitions to scan based on said reference to 
said first key and said first key; and 

if said query includes a reference to both said first and 
second keys, then determining said subset of partitions 
to scan based on said reference to both said first and 
second keys and said predetermined correlation. 

9. The method of claim 7 further comprising the steps of: 
receiving a request to modify said set of data, wherein 

said request includes a subset of data having one or 
more values corresponding to one or more attributes 
from said first key and one or more values correspond- 
ing to one or more attributes firom said second key; and 
enforcing said predetermined correlation between said 
second key and said first key by 
applying a first set of predicates to said subset of data 

to determine whether or not said subset of data 

complies with said predetermined correlation, 
if said subset of data complies with said predetermined 

correlation, then allowing said request, and 
if said subset of data does not comply with said 

predetermined correlation, then disallowing said 

request. 

10. A method for executing queries that specify data from 
a set of data that has been partitioned into a plurality of 
partitions based on a first key, the method comprising the 
computer implemented steps of: 

receiving a query that includes a reference to a second 
key; 

accessing one or more predicates, wherein said one or 
more predicates represent a predetermined correlation 
between said first key and said second key; 

selecting a subset of said plurality of partitions to scan 
based on said reference to said second key and said one 
or more predicates; and 

executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 

11. The method of claim 10 wherein said one or more 
predicates correspond to one or more check constraints, said 
method further comprising the step of enforcing said pre- 
determined correlation between said first key and said 
second key with said one or more check constraints, said one 
or more check constraints limiting values that can be stored 
in attributes of said second key. 

12. The method of claim 10 further comprising the steps 

of: 

adding a set of query predicates derived from said query 

to a set of selection predicates; 
adding one or more predicates to said set of selection 

predicates, said one or more predicates based on said 

predetermined correlation with said first key; and 
using said set of selection predicates to select said subset 

of said plurality of partitions to scan. 

13. The method of claim 12 further comprising the steps 

of: 
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after said step of adding one or more predicates to said set 
of selection predicates, transitively generating new 
predicates based on said set of selection predicates; 

adding said new predicates to said set of selection predi- 
cates; 

removing from said set of selection predicates those of 

said one or more predicates and said new predicates 

that cannot be evaluated in constant time; 
removing from said set of selection predicates those of 1Q 

said new predicates that can produce an unknown 

result; and wherein 

said step of selecting a subset of said plurality of 
partitions to scan includes evaluating constant predi- 
cates in said set of selection predicates, and 

including in said subset those partitions corresponding 
to selection predicates having constant predicates 
that evaluate to a predetermined state. 

14. The method of claim 12 wherein said step of executing 
said query further comprises the steps of: 2Q 

including in a query plan only those partitions of said 
plurality of partitions corresponding to selection predi- 
cates having constant predicates that evaluate to a first 
predetermined state; and 

excluding from said query plan those partitions of said 25 
plurality of partitions corresponding to selection predi- 
cates having constant predicates that evaluate to a 
second predetermined state. 

15. A method for executing queries that request data from 

a set of data that has been partitioned into a plurality of 30 
partitions based on a first key, the method comprising the 
computer implemented steps of: 

receiving a query that includes a reference to an attribute 
that is part of a second key, wherein said second key is 
not part of said first key but has a predetermined 35 
correlation with said first key; 
accessing a set of values associated with said attribute; 
selecting a subset of said plurality of partitions to scan 
based on said reference to said attribute and said set of 
values; and 

executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 

16. The method of claim 15 wherein said set of values 45 
includes at least a high value and a low value for said 
attribute for each partition in said plurality of partitions, each 
high value representing the highest value present in said 
attribute within the corresponding partition, each low value 
representing the lowest value present in said attribute within 50 
the corresponding partition. 

17. The method of claim 16 wherein said set of values are 
stored as part of an index on said attribute. 

18. A method for executing queries on a set of data that 
has been partitioned into a plurality of partitions based on a 5S 
first key, wherein said first key includes one or more 
attributes, the method comprising the computer imple- 
mented steps of: 

receiving a query that includes a reference to a value from 
one of said one or more attributes, wherein a first set of so 
data containing said value is stored in a first partition of 
said plurality of partitions and a second set of data 
contaning said value is stored in a second partition of 
said plurality of partitions; 

selecting a subset of said plurality of partitions to scan 65 
based on said reference, wherein said subset includes 
said first and second partitions; and 
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executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 

19. The method of claim 18 further comprising the steps 


adding a set of query predicates to a set of selection 
predicates; 

generating new predicates based on said set of selection 
predicates and one or more correlation predicates, said 
one or more correlation predicates based on a prede- 
termined correlation of a second key with said first key; 

removing from said set of selection predicates those of 
said one or more predicates and said new predicates 
that cannot be evaluated in constant time; 

removing from said set of selection predicates those of 
said new predicates that can produce an unknown 
result; and 

using said set of selection predicates to select said subset 
of said plurality of partitions to scan. 

20. The method of claim 19 further comprising the steps 
of: 

excluding from said subset of said plurality of partitions 
to scan those partitions of said plurality of partitions 
corresponding to selection predicates having constant 
predicates that evaluate to a predetermined state; and 

including in said subset of said plurality of partitions to 
scan those partitions of said plurality of partitions 
corresponding to selection predicates having constant 
predicates that do not evaluate to said predetermined 
state. 

21. The method of claim 18 further comprising the step of, 
for each partition of said plurality of partitions, generating a 
set of predicates that indicate values for said first key that 
satisfy a predetermined correlation with values for said 
second key that correspond to said partition. 

22. A method for executing queries on a set of data that 
has been partitioned into a plurality of partitions, the method 
comprising the computer implemented steps of: 

receiving a query that includes a reference to a first key; 

accessing data that indicates upper and lower boundary 
values for said plurality of partitions, wherein said 
upper and lower boundary values of each of said 
plurality of partitions are independent of the upper and 
lower boundary values of the other of said plurality of 
partitions; and 

selecting a subset of said plurality of partitions upon 
which to execute said query based on said reference 
and said upper and lower boundary values of each of 
said plurality of partitions. 

23. A computer system comprising: 
a processor; and 

a memory coupled to said processor, said memory having 
stored therein 

a first set of data that has been partitioned into a 

plurality of partitions, 
a second set of data indicating upper boundary values 

for each of said plurality of partitions, 
a third set of data, separate from said second set of data, 

indicating lower boundary values for each of said 

plurality of partitions, and 
sequences of instructions which, when executed by said 

processor, cause said processor to select a subset of 

said plurality of partitions to scan based on a set of 

query predicates and said upper and lower boundary 

values of said plurality of partitions. 
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24. A machine-readable medium having stored thereon 
data representing sequences of instructions, said sequences 
of instructions including sequences of instructions which, 
when executed by a processor, cause said processor to 
perform the steps of: 5 

receiving a query on a set of data, wherein said set of data 
has been partitioned into a plurality of partitions based 
on a first key, wherein said query includes a reference 
to a second key, aud wherein said second key is not part 
of said first key but has a predetermined correlation 10 
with said first key; 

selecting a subset of said plurality of partitions to scan 
based on said reference to said second key and said 
predetermined correlation with said first key; and 

executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 

25. The method of claim 1 wherein the step of selecting 
the subset of said plurality of partitions to scan includes 
selecting at least two of said plurality of partitions. 

26. The method of claim 8 wherein said step of selecting 
the subset of partitions to scan further comprises the step of 
including all of said plurality of partitions in said subset of 
partitions to scan if said query does not refer to one of said 
first key or said second key. 

27. The machine- readable medium of claim 24 further 
comprising instructions for performing the steps of: 

determining a set of query predicates from said query; 
adding said set of query predicates to a set of selection 30 
predicates; 

adding one or more predicates to said set of selection 
predicates, said one or more predicates based on said 
predetermined correlation with said first key; and 

using said set of selection predicates to select said subset 35 
of said plurality of partitions to scan. 

28. The machine-readable medium of claim 27 further 
comprising instructions for performing the steps of: 

after said step of adding one or more predicates to said set 
of selection predicates, transitively generating new 40 
predicates based on said set of selection predicates; 

adding said new predicates to said set of selection predi- 
cates; 

removing from said set of selection predicates those of 
said one or more predicates and said new predicates 
that cannot be evaluated in constant time; 

removing from said set of selection predicates those of 
said new predicates that can produce an unknown 
result; and wherein 50 

said step of selecting a subset of said plurality of partitions 
to scan includes 

evaluating constant predicates in said set of selection 

predicates, and 
including in said subset those partitions corresponding 55 

to selection predicates having constant predicates 

that evaluate to a predetermined state. 

29. The machine- readable medium of claim 27 further 
comprising instructions for performing the steps of: 

excluding from said query plan those partitions of said eo 
plurality of partitions corresponding to selection predi- 
cates having constant predicates that evaluate to a 
predetermined state; and 

including in a query plan only those partitions of said 
plurality of partitions corresponding to selection predi- 65 
cates having constant predicates that do not evaluate to 
' said predetermined state. 
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30. The machine -readable medium of claim 24 further 
comprising instructions for performing the steps of: 

receiving a request to modify a subset of data in said set 
of data, wherein said subset of data includes one or 
more values for said first key and one or more values 
for said second key; 

determining whether said one or more values for said 
second key have said predetermined correlation with 
said one or more values for said first key; 

if said one or more values for said second key have said 
predetermined correlation with said one or more values 
for said first key, then selecting a target partition based 
on said one or more values for said first key, and storing 
said subset of data in said target partition; 

if said one or more values for said second key do not have 
said predetermined correlation with said one or more 
values for said first key, then generating an error 
message without adding said subset of data to said set 
of data. 

31. The machine -re ad able medium of claim 24 further 
comprising instructions for performing the step of, for each 
partition of said plurality of partitions, generating data that 
indicates values of said second key that satisfy said prede- 
termined correlation with values for said first key that 
correspond to said partition. 

32. The machine-readable medium of claim 24 wherein 
the step of selecting the subset of said plurality of partitions 
to scan includes selecting at least two of said plurality of 
partitions. 

33. A machine-readable medium carrying one or more 
sequences of instructions for producing query plans for 
executing queries on a set of data that has been partitioned 
into a plurality of partitions based upon a first key, wherein 
execution of the one or more sequences of instructions by 
one or more processors causes the one or more processors to 
perform the steps of: 

receiving a query; 

selecting a subset of partitions to scan from said plurality 
of partitions, the selection being performed by 
if said query includes a reference to a second key and 
does not refer to said first key, then determining said 
subset of partitions to scan based on said reference to 
said second key and a predetermined correlation 
between said second key and said first key, wherein 
said second key is not part of said first key; and 
producing a query plan which includes only those parti- 
tions of said plurality of partitions that belong to said 
subset of partitions. 

34. A machine-readable medium carrying one or more 
sequences of instructions for executing queries that specify 
data from a set of data that has been partitioned into a 
plurality of partitions based on a first key, wherein execution 
of the one or more sequences of instructions by one or more 
processors causes the one or more processors to perform the 
steps of: 

receiving a query that includes a reference to a second 
key; 

accessing one or more predicates, wherein said one or 
more predicates represent a predetermined correlation 
between said first key and said second key; 

selecting a subset of said plurality of partitions to scan 
based on said reference to said second key and said one 
or more predicates; and 

executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 
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35. A machine-readable medium carrying one or more 
sequences of instructions for executing queries that request 
data from a set of data that has been partitioned into a 
plurality of partitions based on a first key, wherein execution 
of the one or more sequences of instructions by one or more 5 
processors causes the one or more processors to perform the 
steps of: 

receiving a query that includes a reference to an attribute 
that is part of a second key, wherein said second key is 
not part of said first key but has a predetermined 1° 
correlation with said first key; 

accessing a set of values associated with said attribute; 

selecting a subset of said plurality of partitions to scan 
based on said reference to said attribute and said set of s 
values; and 

executing said query by scanning only those partitions of 
said plurality of partitions that belong to said subset of 
partitions. 

36. A machine-readable medium carrying one or more 2 o 
sequences of instructions for executing queries on a set of 
data that has been partitioned into a plurality of partitions 
based on a first key, wherein said first key includes one or 
more attributes, wherein execution of the one or more 
sequences of instructions by one or more processors causes 25 
the one or more processors to perform the steps of: 

receiving a query that includes a reference to a value from 
one of said one or more attributes, wherein a first set of 
data containing said value is stored in a first partition of 
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said plurality of partitions and a second set of data 

containing said value is stored in a second partition of 

said plurality of partitions; 
selecting a subset of said plurality of partitions to scan 

based on said reference, wherein said subset includes 

said first and second partitions; and 
executing said query by scanning only those partitions of 

said plurality of partitions that belong to said subset of 

partitions. 

37. A machine-readable medium carrying one or more 
sequences of instructions for executing queries on a set of 
data that has been partitioned into a plurality of partitions, 
wherein execution of the one or more sequences of instruc- 
tions by one or more processors causes the one or more 
processors to perform the steps of: 

receiving a query that includes a reference to a first key; 
accessing data that indicates upper and lower boundary 
values for said plurality of partitions, wherein said 
upper and lower boundary values of each of said 
plurality of partitions are independent of the upper and 
lower boundary values of the other of said plurality of 
partitions; and 
selecting a subset of said plurality of partitions upon 
which to execute said query based on said reference 
and said upper and lower boundary values of each of 
said plurality of partitions. 
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